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Abstract 

We investigate exceedances of the process over a sufficiently high threshold. The 
exceedances determine the risk of hazardous events like climate catastrophes, huge 
insurance claims, the loss and delay in telecommunication networks. Due to de¬ 
pendence such exceedances tend to occur in clusters. Cluster structure of so¬ 
cial networks is caused by dependence (social relationships and interests) between 
nodes and possibly heavy-tailed distributions of the node degrees. A minimal time 
to reach a large node determines the first hitting time. We derive asymptotically 
equivalent distribution and a limit expectation of the first hitting time to exceed 
the threshold u n as sample size n tends to infinity. The results can be extended 
to the second and, generally, to kth {k > 2) hitting times. 

1. Introduction 

Let {X n } n >x be a stationary process with marginal distribution function F(x ) and 
M n = max{Xi, ..., X n }. We investigate the exceedances of the process over a 
sufficiently high threshold u. Due to dependence such exceedances tend to occur 
in clusters. Let us consider the inter-cluster size 

Ti(u ) = minjj > 1 : Mij < u,Xj + i > u\Xi > u}, (1) 

i.e. the number of inter-arrivals of observations running under the threshold be¬ 
tween two consecutive exceedances, where M i 3 = max{X 2 , ...,Xj}, M ltl = —oo. 
Let T*(u n ) be a first hitting time to exceed the threshold u n . We get 

P{T*(u n ) = j + 1} = P{Mj < u n , X j+ i > u n }, (2) 

j = 0,1,2,..., Mq = -oo. 

The necessity to evaluate quantiles of the first hitting time and its mean is arisen 
in many applications. In social networks it is important to compare sampling 
strategies (Avrachenkov et al. (2012), Avrachenkov et al. (2014a), Avrachenkov 
et at. (2014b), Chul-Ho Lee et al. (2012)) like random walks, Metropolis-Hastings 
Markov chains, Page Ranks and other with regard to how quickly they allow to 
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reach a node with a large degree that is the number of links with other nodes. It 
is important to investigate the first hitting time of significant nodes since it allows 
us to disseminate advertisement or collect opinions more effectively within the 
clusters surrounded such nodes. Similar problem is required in telecommunication 
peer-to-peer networks to find a node with a large number of peers, Dan and Fodor 
(2009), Markovich (2013). 

Definition 1.1. (Leadbetter et al. (1983), p.53) The stationary sequence {X n } n >i 
is said to have extremal index 6 e [0,1] if for each 0 < r < oo there is a sequence 
of real numbers u n = u n (r) such that 


lim n( 1 — F(u n )) = r and 

n—> oo 

lim P{M n < u n } = e~ r6 

n—>■ oo 

hold. 


( 3 ) 

( 4 ) 


The extremal index 6 of {X n } relates to the first hitting time T*(u n ), Roberts et al. 
(2006). Really, since u n is selected according to (|3]) it follows that P{X n > u n } is 
asymptotically equivalent to 1/n. Notice, that P{M k < u n } = P{T*{u n ) > k}. 
Hence, substituting r by (|3j) we get from (j4j) 


P{T*(u n )/n > k/n} ~ e - ekp {Xn>u n } _ e ~ek/n 


and 

lim P(T*(u„)/n > x) = e~ 9x 

n—> oo 

for positive x. It follows 


lim E(T* (u n ) / n) = 1/6. (5) 

n—> oo 

This implies, the smaller 6, the longer it takes to reach an observation with a large 
value. 

Using achievements regarding the asymptotically equivalent geometric distribution 
of Ti(x Pn ) derived in (Theorem 11.11 Markovich (2014)), where the (1 — p n )th 
quantile x Pn of {X n } is taken as u n , we derive an asymptotically equivalent dis¬ 
tribution of the first hitting time and its limiting expectation that specifies ((5]) in 
Section [2 

We use the mixing condition proposed in Ferro and Segers (2003) 

Oi n ,q{u) = max sup \P{B\A) — P(B)\ = o(l), n—t oo, (6) 

1 <k<n—q 

where for real u and integers 1 < k < l, T k ,i(u) is the a-field generated by the 
events {Xi > u}, k < i < l and the supremum is taken over all A e Ti^-u) with 
P(A) > 0 and B e fFk+ q ,n{u) and k, q are positive integers. 

We use the following partition 


1 < K fl < k* >2 < k* n<3 < k* nA < j, j = 2 , 3 ,..., ( 7 ) 

Kfi = !. fc n,5 = j’ K ,i = \jkn,i/n\ + 1 ,i = {1,2}, k* nA = j - [ jk nA /n ], k* nA = 
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j - \jk n ,3/n], 0 of the interval [1, j] for a fixed j, where positive integers {k ni } 
are such that 


{k n ,i -1 = o(k Hji ), i G {2, 3,4}}, k nA = o(n). 


( 8 ) 


Theorem 1.1. (Markovich (2014)) Let {X n } n >i be a stationary process with the 
extremal index 6. Let {x Pn } and {x p *} be sequences of quantiles of Xi of the 
levels {1 — p n } and {1 — p* n }, respectively H those satisfy the conditions (0|) and (d)) 
if Un is replaced by x Pn or by x p * and, q n = 1 - p n , q* n _=_ 1 - P* n , P* n = (1 - q 9 n ) 1/e ■ 
Let positive integers {fc* J, i — 0, 5, and { k ni \, i = 1,4, be respectively as in (0|) 
and (B), p* ni = o(A„,i) f A np = k* Ll - = o(p^), i G {1,2, ...,5} and 

{Pn 3 } he an increasing sequence, such that 


a n( x Pn) — ma x{«fc;,4< 1 ; Q! fc; i 3< 2 ; «a„, 3,5* !3 ; a j+i-fe*, 2 ,< i4 ; 


(9) 


and 


0i j+ 1 2 K,4-K, 1 “ °( a 0 ( 1 °) 

hold as n —>■ 00 , where a nA = OL n , q (x Pn ) is determined by §&). Then it holds for 

J > 2 


lim = i}/(p n (l -Pn) 0 1)0 ) = 1, (11) 

n—>-oo 

The achievements can be similarly extended to the second and, generally, to /rth 
(fc > 2) hitting times. The paper is organized as follows. In Section |2]we derive the 
asymptotically equivalent distribution and the limit expectation of the first hitting 
time to exceed sufficiently high threshold. Proofs are given in Section [3] 


2. Distribution and expectation of the first hitting time 

Theorem 2.1. Let all conditions of Theorem ll.ll are satisfied. We assume 

sup p n E{Ti{x Pn )) < 00 . (12) 

n 

Then we get 

lim nP{T*{u n ) = n} = e~ dT /9, (13) 

71—>00 

P{T‘(x p „)=i} ~ y(l - 0 p n y-\ (14) 

as n —* 00 . 

1 [a^] represents the integer part of the real number x. 

2 F{.X Pn ) = P{X 1 > Xpn }= Pn- 


3 



Expression (fT4l) implies that P{T*(x Pn ) — j} 


r 


P{Tg = j}, where 


_ f X, with probability 1 /6> 2 ; 

9 | 0, with probability 1 — I/O 2 

holds and x has the geometric distribution with probability p n 6. 


Remark 2.1. The condition /fl2l) provides a uniform convergence of the range 
E7= 1 p {Ti (x Pn ) = .]} by n - Tbe latter condition is fulfilled for a geometrically 
distributed Ti(z p J, i.e. P{Ti(a; p J = j} = p n { 1 - p n ) J ' _1 . 

Lemma 2.1. Let conditions of Theorem 12.11 be satisfied. Then it follows 

lim Pn ET*(x p J = 1/0 3 . (15) 

n—> oo 

Since p n ~ r/n according to (J3J) , the expression (TT5D specifies ([5]). 


3. Proofs 


3.1 Proof of Theorem 12.11 

It follows from ([2]) that 

P{T*[u n ) = n} = P{M n _i < u n ,X n > u n } 

= P{M n _i < u n } - P{M n < u n }. 

We obtain due to ([ID and the stationarity of {X n } that it holds 


(16) 


P/TxiUn) = n} = P{Mi t n < U n ,X n+ 1 > u n \X 1 > u n } 

= (P { Mi n T U ni X n .|_i > U n ]• P{M n S Uni Xn+1 ^n}) / ^{^"l M n } 

= (P{M n _i < X n+ i > «„} - P{M n < u n , X n+l > w n }) /P{Xi > u n } 
= (P{T*(u n ) =n}~ P{T*(u n ) = n + 1}) /P{X 1 > u n }. (17) 

Following Ferro and Segers (2003) we get alternatively for n > 1 

P{TiK) > n} = P{M 1}U+1 < u n \X 1 > u n } 

= (P{Mi )fl+ i < w n } - P{M n+ i < «„}) /P{A"i > u n } 

= (P {M n < u n } - P{M n+ 1 < W n }) /P{Xi > W n } 

= P{T*(w n )=n + l}/P{X 1 >ti n }. 

Thus, we get 

P{T"(x p J = n + 1} = P{A' 1 >^„}.P{T 1 (x p „)>,i} 

oo 

= pn 57 P { T l( X P») = 
z=n+l 
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(18) 




where p n = P{X\ > x Pn }. Now we use the fact followed from fllTD in Theorem 
11.11 i.e. it holds 


c n P{T\(x Pn ) = i} ~ rj n (l - i] n y 1 


(19) 


as n ->• oo, where 1 - rj n = (1 - p n ) e , c n = r) n /( 1 - (1 - r/n) 1/0 ), 0 < r? n < 1. 

Furthermore, we obtain 




n + 1} - 




OO OO 

— c n P{Ti(u n ) = f} ~ — Y r/ n (l 

i=n -\-1 2 =n+l 

^(1 - »„)" — — (1 — (1 — ,„)'/») (1 - v, 
C n Vn 

Pn(^ ~ @Pn) Pn Q \n Pn -Or 

T -^— J) =-(l-e Pn ) ~ T e . 


Vn) 


i—1 


n 


( 20 ) 


since (1 - p n f = 1 - 8p„ + o(p„) and lim„(l - 0p n ) n = e ® T as p n ~ r/n, 

n ^ oo. Formula (1201) implies both (fl3ll rewriting it with regard to P{T*(x Pn ) = 
n + l}/p n and (fl4l) replacing n + 1 by j. 

Hence, we have from (l20l) 

p { T *{x Pn )=]} ~ ^p n 0(l - p n ey~\ n-> oo. (21) 


The similarity in the first string of (1201) follows from (TT2li and since 


supr fe (n) = sup Y P{T 1 (x Pn )=i} 

n n i=\k/ Pn \ 

OO 

= sup Y - p { T i(x Pn ) = i}< sup ^E{T]{x Pn )) -> 0 

n i=\_k/ Pn \ l n K 


holds as k —> oo. 


3.2 Proof of Lemma 12.11 

Using (fl8i) and (fT9l) we find the expectation of the first hitting time 


OO OO 


ET'(x „J = J]iP{r*(r,J=i} = J]ip„J]P{T 1 (r p J = i} 


1 = 1 

OO oo 

■P 


j =1 i=j 


^ ^ c n r] n ^ 


. Cn 

j=l 

Pn 


X^'Pn(l-Pn) J 1 


Pn 


P* (1-(1-Pn) e ) 3 ' 

The similarity follows from the same arguments as before. Then (TT5li follows. 
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